首页> 外文OA文献 >A Framework for Enriching Lexical Semantic Resources with Distributional Semantics
【2h】

A Framework for Enriching Lexical Semantic Resources with Distributional Semantics

机译:用分布式语言丰富词汇语义资源的框架   语义

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We present an approach to combining distributional semantic representationsinduced from text corpora with manually constructed lexical-semantic networks.While both kinds of semantic resources are available with high lexicalcoverage, our aligned resource combines the domain specificity and availabilityof contextual information from distributional models with the conciseness andhigh quality of manually crafted lexical networks. We start with adistributional representation of induced senses of vocabulary terms, which areaccompanied with rich context information given by related lexical items. Wethen automatically disambiguate such representations to obtain a full-fledgedproto-conceptualization, i.e. a typed graph of induced word senses. In a finalstep, this proto-conceptualization is aligned to a lexical ontology, resultingin a hybrid aligned resource. Moreover, unmapped induced senses are associatedwith a semantic type in order to connect them to the core resource. Manualevaluations against ground-truth judgments for different stages of our methodas well as an extrinsic evaluation on a knowledge-based Word SenseDisambiguation benchmark all indicate the high quality of the new hybridresource. Additionally, we show the benefits of enriching top-down lexicalknowledge resources with bottom-up distributional information from text foraddressing high-end knowledge acquisition tasks such as cleaning hypernymgraphs and learning taxonomies from scratch.
机译:我们提出了一种将文本语料库中的分布语义表示与人工构建的词义语义网络相结合的方法。虽然两种语义资源都具有很高的词法覆盖率,但我们的对齐资源却结合了领域特异性和分布模型中上下文信息的可用性,且简洁明了手工制作的词汇网络的质量。我们从诱导性词汇术语的分布表示开始,这些术语与相关词汇项给出的丰富上下文信息相伴。然后,我们会自动消除此类表示的歧义,以获得完整的原型概念化,即归纳出的词义的类型化图表。最后,将这种原始概念化与词汇本体对齐,从而产生混合对齐资源。此外,未映射的诱导感觉与语义类型相关联,以便将它们连接到核心资源。针对我们方法的不同阶段针对真实性判断进行的人工评估以及基于知识的单词SenseDisambiguation基准的外部评估均表明了这种新型混合资源的高质量。此外,我们展示了使用自下而上的文本分发信息来丰富自上而下的词汇知识资源的好处,这些信息可用于解决高端知识获取任务,例如清理超音符和从头开始学习分类法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号